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IDENTIFIERS 
ABSTRACT 

Two experiments were conducted to determine whether a 
meaning emphasis or a word identification emphasis would give better 
results in a third grade reading lesson. The results investigated 
more than 30 factors and their two-way interactions. In the first 
experiment, 259 children were placed into reading groups arranged 
either homogeneously or heterogeneously according to teacher judgment 
and standardized reading test scores. Each small group met 
individually with a researcher who conducted a reading lesson with 
either a meaning or a word identification emphasis. The second 
expejciment was a replication of the first, with 86 children 
participating. Results showed that lessons with an emphasis on 
meaning produced better results than those with an emphasis on word 
identification with both good and poor readers. Other findings 
include the following: il) in groups receiving the word 
identification emphasis, reading performance depended on 
instructional time; (2) in both groups, the child taking an active 
turn got more from the lesson than did the children who followed 
along; and (3) the level of interest of the material was a major 
factor in reading performance, far more imoortant than readability 
(FL) ^ 
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The Reading Group: An Experimental Investigation 
of a Labyrinth ^ 

There Is properly no history; only biography. Every mind must 
know the whole lesson for itself, must go over the whole ground. 
What it does not -,ee, what it does not live, it will not know. 
(Ralph Waldo Emerson, Essays , 18^2/19^15, p. 6) 

The Great Debate (Chall, I967) in beginning reading instf'uction is over 
the relative emphasis that ought to be given to decoding and meaning. The 
available evidence favors a decoding emphaSis. It appears that reeding 
programs that begin with explicit, direct instruction in spelling-sound 
correspondences are more successful than programs that rely on incidental 
learning of these correspondences (Pflaum, Walberg, Kanegianes, £■ Rasher, 
1980). However, It is possible that programs that include a substantial 
amount of direct instruction in spelling-to-sound correspondences are 
successful for other reasons. Such programs typically are more structured, 
provide more systematic feedback, allocate more time to reading, and main- 
tain hrgher levels of student engagement than meaning emphasis programs, 
whose advocates often believe that learning to read is a "natural process" 
in which it is unwise to intervene heavily (Goodman & Goodman, I979). Thus-, 
it can be argued that program evaluations and 'related teacher effectiveness 
research have underrepresented classrooms In which the instruction is both 
meaning-oriented and structured and systematic. There is at least one 
beginning reading program that features both a meaning orientation and 
systematic direct instruction, the Kamahamaha Early Education Project. It 
is thoroughly documented that this program achieves good results with 
at-risk minority chi Idren (Tharp, 1982). 
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If one were to grant that direct instruction designed to produce 
competence in fast, accurate word identification typically is best at the 
beginning,, an important policy question would still remain. No one doubts 
that the eventual goal is for children to read with comprehension. Tha 
question Is, therefore, at what point in a child^s development of reading 
proficiency should the schools stop stressing word identification andt)egin 
placing predominant emphasis on meaning? 

On the one hand, Venezky and Hassaro (1979) have doubted that more 

instruction in vjord identification ould ever be too much of a good thing. 

They concluded an article on the importance of rapid word recognition by 

saying they were no longer willing to agree, as they once did, with Chall 

(1967, p. 307) who they then quoted as follows: 

• Once the pupil has learned to recognize in print the words he 
knows (because they are part of his speaking ^nd listening 
vocabulary), any additional work on decoding on his part is a 
shear waste of time. 

Similarly, Perfetti and Lesgold (1977, 1979) have summarized research 
that shows large differences between good and poor readers in speed of 
pronunciation of unfamiliar words and pronounceable nonwords. This evidence 
establishes beyond any reasonable doubt that good and poor readers differ 
In their understanding of spelling-to-sound correspondences, whereas. In 
their research, Perfetti and Lesgold were unable to find evidence that good 
and poor readers differ substantially in use of higher level text informa- 
tion. From this they concluded that reading programs should include even 
more practice than typically given in accurate word pronunciation, more 
* drill in speeded word recognition, and mor^ practice in immediate memory 
for the facts in stories and texts. However, as Perfetti and Lesgold 
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acknowledged, attempts to improve reading comprehension u^ing speeded word 
drills have not yet proved very successful (Jenkins, Pany, s Schreck, 1978) 
On the other hand, there is a cornucopia of evidence 'that £ould be> > 
. . cited to support a stress on meaning in ^He reading program. Indeed, the 
benefits of an emphasis on meaning observed in basic research are so strong 
and so consistent that it is possible, in the fashion of scientists of 
another generation, to proclaim an empincal law. It might be called the 
Law of Meaningful Processing and it can be formulated as follows: Other 
things being equal, people ^earn, and remember more when conditions require 
them to understand the material. There is a problem in specifying exactly 
what is r^ant by terms such as "meaningful" and "understand" (Baddeley 
1978). but for the moment informal, ordinary language senses will" do. 
'^^ ^ °^ '^'^.^ '■^s^^'-ch illustrating the Law of Meaningful " 

Processing has involved recognition or recall of lists of word3. A study 
by Craik and Tulving (I975. E>i 1) is a good example, though convincing 
evidence from many earlier studies is available (e.g., Baddeley, I966; Hyde 
£ Jenkins, I969). Craik and Tulving had college students make judgments ' 
about a^series of words presented one at a time. The judgment tasks ranged 
• from those that did not demand.any meaningfu.l processing to ones thae 
demanded a lot. Four of the tasks wi 1 1 be described. The f i rst requi red 
only processing of graphemic information. The question was, "Is the word 
in capital (or lower case) letters?"--TABLE. The second task required 
- processing of phonemic information. The question was, "Does. the word 

rhyme with (for instance) weight?"- CRATE. The third task involved judging 
whether the word named a member of a certain category. Tl^e question was, 
"Is the word a type of (for instance) f lsh?"-SHARK. The fourth task 
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required determining whether a word could bejinserted In a certain'sentence. 
The question was (for instance), 'Vould the word fit in the sentence, 'He 

'"^'^ a the street' ?"--FR!END. The subjects thought the point of 

the experiment was to determine how quickly the judgments could be 
completed, but when they were finished a surprise test was given that 
required recognizing the -words that had been judged and discriminating them 
from other words. Pooling pver ^he words for which the right answers in 
the earlier judgment task had been "yes" and "no," the percentage of words 
recognized from the graphemi c, phonemic, category, and sentence tasks were 
16%, 51%, 78%, and 30%, respectively. 

A number of experiments have studied the effects of meaningful 
processing on sentence learning and remembering. For i I I us t rat loft we wi 11 
present several examples from our own research. Anderson and Hidde (1971) 
had college students rate either the pronouncibi I i ty or the image-evoking 
potential of a series of sentences and then gave a surprise recall test. 
Notice that creating an image certainly requires contact with meaning 
whereas a person can evaluate how easily a sentence rolls off the tongue 
without constructing a meaningful representation. Subjects who rated 
imagery recalled 65% of the sentences while those who rated pronouncibi 1 i ty 
recalled only 25%. Anderson, Goldberg, and KTdde (1970 and Kane and 
Anderson (1978) asked adult subjects to read aloud sentences such' as 
"Elevators stop at every floor." Half of the time there was a blank in 

\ 

the place qf the last word and the'word that fit the blank was highly 
predictable.. The hypothesis' was that filling blanks would increase 
learning since a reader cannot supply a missing word in a sentence of the 
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type Illustrated without actively bringing to mind a meanlngfulVepresenta- 

°^ °^ sentence, whereas pronouncing the sarpo sentence as 

a whole can- be done without accessing meaning. The results from several 
experiments have confirmed this hypothesis; subjects who filled blanks 
recalled about 10^ more sentences on the average. Closer to instruction ' 
, Is a study by Anderson and Kulhavy (1972). College students read defini- 
tions of difficult, unfamiliar concepts, such as. atavistic, cuprous, 
palliate., and xanthous, and then answered multiple choice items that 
required identifying examples' of the concepts. As they read, half of the 
subjects were instructed to create and say aloud a sensible sentence • 
containing each defined word. For instance, given the definition 
"Atavistic means reversion to a primitive type," one subject said, -Mayor 
Daley's politics are atavistic." The remaining subjects read each 
definition aloud three times, a task which took about the iame amount of 
time as composing a sentence. After one exposure to the definitions 
subjects who had composed sentences got S5Z of the test Items correct; 
whereas subjects who had orally repeated the def inI-tion''s got. ^ correct. 

Finally, research consistently indicates that condl tions thafpromote 
meaningful processing facilitate learning from connected text. For 
instance, Schallert (I976) found that subjects given directions to rate 
paragraphs for clarity/ambiguity or for difficulty, two tasks that require 
evaluating meaning, performed substantially better on recall and recognition 
tests than subjects who coi,nted four-letter words or counted number of 
personal pronouns, two tasks tha; do not requireUch contact with meaning. 
Some of the studies with text have obvious practical as well as theoretical 
implications. Watts and Anderson (1971) asked high school students, to 
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answer a question after reading each of several passages explaining psycho- 
I logical' principles. The question' involved identifying an example of the 

^ principle or the name of the psychologist associated with the principle/ "" 
■Students whose questions required them to apply the principles to new 
^ examples performed signi flcantly, better on a later test than aj 1 other " 
i-.i:udents, including students who received otherwise identical questions 
that repeated examples described in the, text. Students who answered name 
questions performed worst of all, poorer even than students who read with- 
out questions. Watts and Anderson argued that appl i cat ion- to-new-example 
questions induced students to process the passages in a deep," meaningful 
manner. In the same vein. Glover and his associates '(e.g. , Glover, Bruning, 
& Blake, 1982) have done a number of experiments showing better learninq 
and rememberi.ng of the information in a text when students are required to 
formulate logical extensions of the text, paraphrase the text, or judge 
inferences from the text, all activities that require meaningful processing. 

Wh,ile the facts are clear, just why meaningfOf processing facilitates 
learning' and remembering has been a matter of controversy. In an 
influential paper, Craik and Lockhart (I972) proposed what they called a 
"ievels-of-processing" theory. The essential idea was that every level of 
analysis of written language— graphemic, phonemic, semantic— leaves a trace 
in memory and the ^deeper" the analysis the more enduring the trace, this 
theory has been cri t i ci zed - for a*variety of reasons, notably because it 
is vague, indeed, little more than a restatement of the facts (BaddeJey, 

1978) . 'In subsequent papers, Craik and his colleagues have revi sed "the! r 
formulation and now speak of "distinctiveness" (Jacoby, Craik, s Begg, 

1979) and "rich semantic elaboration" (Craik S Tulving, 1975). 
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.The term '^semantic elabora^tion*' refers to the subject's tracing ^ 
connections between the presented materiaKand background knowledge. While 
research has coafirmed that elaboration plays role (Ross, 1981), it is 
now also clear tha-t the sheer amount of elaboration is not the key. Stein 
and Bransford (1979) found that subject^ were' si ightly worse at recalling 
core sentences such as "The fat man read* the 'si'gn'*- when the sentences were 

t 

elaborated as in, "The fat man read the sign th^t was two feet high.'-* Ih 
contrast', recall of the core sentences was substantially better when 
"precise elaborations," as ih "The fat man read the sign warning of the 
thin ice,"' vv^ere added. A precise elaboration clarified the significance 
of concepts in the core sentence and Indicated how the concepts fit 
together. An "imprecise elaboration," on the other hand, extended the 'core 
sentence «n an arbitrary manner; it did nothing to clarify the 'corjs ' 
sentence or show how the constituent concepts related to one ariother. 

A further insigh.t into meaningful processing has been f)rovi ded in 
research beginning wi th the studies of Anderson, Goldberg, and ttidde (1971) 
a 1 ready reviewed: It is important for the. reader to' be involved in an 
active efrort after meaning. Evidence cor'irmJng this point. has been • 
provided in a cleVer series of experiments by Auble, Franks, and Soraci 
(1979) who presented sentences such as "The party stalled because the wire 
straightened." Five seconds later a clue was presented th:at'helped the 
subject, figure out the Sentence's meaning. The clue was "corkscrew" in 
this case. Subjects recalled more sentences when they received this 
arrangement than when th^y spent the same amount of time reading a sentence 
that already embodied the clue word, in this case, "The party stalled 
because the corkscrew wire straightened." The investigators conclude(;l 
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that the results support the importance of an »»ah^" experience which 
entails the readers' making an active effort to move from a state of 
incomprehension to a state of comprehension. 

To summarize, while there^ ts still much to be learned, we know two of 
the boundary conditions that explain the inner workings of the Law of 
Meaningful Processing. First, learning is facil i tated when information is 
elaborated in such a way that all of the information is integrated into a 
coherent representation. Second, learning is faci I i tated when the reader 
is actively involved in attempting to generate a coherent representation. 
Stating these fundamental points in the elegant language of the nineteenth 
century American philosopher, Ralph Waldo Emerson (]8Wl9't5) "Every mind 
must know the whole lesson for itself, must go over the who.le ground. What 
it does not see, what it does not live, it' will not know" (p. 6). 

It 'is a most important fact th^t young children and- low ability persons 
are less likely than older or more able persohs to engage spontaneously in 
•rneaningful processing. One of several studies pointing to this conclusion 

f 

was done by Paris and Lindauer (1976). First and fifth graders listened to 
sentences such as "The workman dug a hole in the ground," or, alternatively 
"The workman dug a hole in'the ground with a shovel." Given the instrument 
-word, such as "shovel.," as the cue, the first graders recalled many more 
sentences when the instrument had been explicitly mentioned than when.it 
had been left implicit. In contrast, fifth graders recalled almost as many 
sentences/ in which the instrument used to perform some action w^b unstated 
as they did sentences in which it was stated. These resulljs appear to mean 
that the fifth graders spontaneously, drew inferences that meaningfully 
integrated the information in the sentences with background knowledge, while 
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the first graders did not. In another experiment, first graders acted out 
sentences they heard. Under these conditions their performance on 
sentences with unstated instruments improved sharply, presumably because 
they were now meaningfully- integrating the sentences with background 
knowledge. The evidence suggesting that young and low achieving children 
do not readily engage in meaningful processing provides support for an 
emphasis on meaning among young readers, and some grounds for the fear that 
a steady diet of %vord i dent i fj cat ion drill will produce children who are 
nothing but "word cal ler^." , 

It is often said that .classroom instruction is a "complex" process. 
Provided this is more than a code word for the belief that instruction Is 
a mysterV beyond the pale of empirical investigation, we take the view that 
instruction is complex to mean that interactions are expected among aspect^ 
of teaching method, setting, materials, and student characteristics. Very 
little instructional research has been designed so that it could illuminate 
interactions. 'Experimental investigations typically study the effects of 
at^ost one or two factors witb everything else allegedly held constant.' 
Systematic classroom observation studies generally attempt to measure many 
factors, but the data are often aggregated by classroom and are usually 
aggregated over weeks or even Months of instruction during which e&ch 
teacher has used different techniques and a variety of kinds of materials 
and each student has played various roles and engaged in various activities.. 
If instruction truly .is a complex process,, then the trends^ that could 
emerge in aggregated data would only be a dim reflection of the interplay • 
forces at work. 
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In elementary school classrooms, the largest portion of children's 
reading instruction takes place when the teacher works on reading with small 
groups of students. Particularly in the first three or four grades, the 
children usually are divided into several groups according to ability. 
While one group works with the teacher, the others complete workbooks and 
exercise sheets at their seats. Though the sequence is not usually this 
complete or orderly (Mason, I982), the chi Id.re^ in the reading group are 
introduced to the new words in the day's basal l^der story and may be 
helped to develop some story background; they may r^ad the, story silently, 
but more often they take turns reading it aloud with corrections of mistakes 
as needed by the teacher; they answer questions about the story; they 
occasionally receive direct instruction on. some aspect of reading; and 
finally they receive directions for seatwofk. 

Although there are many questions about the effectiveness of the small- 
group reading lesson, its complexity has meant that the critical issues of 
concern to reading educators have never been sat! sfactori ly examined. This 
is despite the fact that there have b_3n great strides in research on 
teaching in the past decade. One problem is, as Duffy (I98I, p. I13) has 
noted, that "reading research and teaching research have been moving forward 
'out of earshot' of each other. Research on reading reflects little of 
what has beeh discovered about teaching; the research on teaching reflects 
few of the findings about the reading process." 

The research reported in this paper was concerned with both the process 
of reading and teaching effectiveness. Four classes of variables were 
investigated, namely aspects of teachiia method, setting characteristics, " 
attributes of materials, and the reading proficiency and other personal 
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characteristics of the children. A noteworthy feature of the research 
described here is that interactions among all four classes of variables 
were systematically examined. 

The major purpose of the present research was to try to provide a 
partial empirical answer to the question of under what circumstances reading 
instruction ought to emphasize 'meaning to a greater extent than word 
Identification. The children were third graders who ranged widely in 
reading ability and the materials ranged widely in difficulty. Thus, the 
study had the potential to pinpoint the level of reading ability of children 
and level of readability of materials at which a meaning emphasis may get 
better results than h word identification emphasis. 

For half of the children who participated in the study, meaning was 
stressed. This was done by requiring the children to pre'dict what might 
happen next after reading each of a number of sentences. For instance, 
after reading the sentence, "The stupid child ran into the street after the 
ball," n^ost children said, "He got hit by a car," "He almost got hit by a 
car," or sometimes, "His mother screamed at him." Inconsequential miscues 
during reading were ignored. If a child made a serious miscue", the teacher 
supplied the word and the child went right on. 

With the remainder of the children, accurate word identification was 
emphasized instead of sentence content. Whenever a ghild mispronounced a 
word, the teacher supplied the word, and the child repeated it and finished 
the sentence. Then the child repeated the whole sentence. If there were 
> still miscues, the teacher corrected them again and the child had to repeat 
the sentence another time. This process contini-ed until the child gave the 
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sentence a fluent, word-perfect reading. Since the material was rather 
difficult for third graders, many poor readers had to repeat some sentences 
two or three times. 

Over thirty factors and their two-way interactions were studied in 
this research. One reason so many factors were investigated is that if 
reading -instruction is truly complex, then the questions about a meaning 
emphasis or a word identification emphasis cannot be answered independently 
of questions about other factors. While a great variety of. issues of 
potential interest to reading educators were explored, for the sake of 
clarity of exposition, the discussion in the remainder of this paper Is 
organized around five questions, including the major one about a meaning or 
a word identification emphasis. These questions are as follows: 

1. Which is more effective in a small-group, third grade reading 
lesson, an emphasis on meaning or an emphasis on accurate 
word identification? 

2. Does homogeneous grouping contribute to the effectiveness 
of reading lessons? 

3. In a small-group reading lesson, does the child who is tai<ing 
an active tu. n get more from the lesson than the children who 
are fol lowing along? 

Which is more important, the readability or the interesting- 
, ness of the materials? 
5. Do the teaching methods, grouping arrangements, and materials 
that are most successful depend vpon the reading ability'^and 
other personal characteristics of 'the child? " 
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Experiment ] 

Method 

Subjects. The subjects were 264 third grade students from five schools 
In Urbana, Illinois. Four students were lost when their reading group was 
disrupted by a guest musician in class. An additional student was lost 
when he was unable to furnish a recall because he was called out of class 
to another part of the building. This brought the total number of subjects 
used in the major analysis to 259, 129 boys and 130 girls. 

Only 161 of these children could be given a test measuring personality 
characteristics. This was due in part to, attrition , since the personality 
test was given at a different time, but mainly to school administration 
concerns about possible parental objections to the test." 

independent group of 53 third grade students from 
Rantoul, Illinois, 31 boys and 22 girls, rated 72 sentences as to how 
interesting they were. From these ratings, the 36 sentences used in the 
study were chosen on the basis of their overall interest value and on their 
differential interest to boys and girls. The final list contained nine 
sentences that were interesting to botf. boys and girls (e.g., "The hungry 
children were in the kitchen helping mother make donuts'-), nine sentences 
that were uninteresting to both groups (e.g., 'The old shoes were put away 
in the back of the closet"), nine sen"tences that- were more interesting to 
■boys than ^irls (e.g., "Green blood ran out whe^n the boy shot an arrow 
through the monster's head"), and nine sentences that were rr^re interesting 
to ..girls than boys (e.g., "The crowded schoolyard was full of girls getting 
ready for the jump rope contest"). 
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The sentences were written to vary widely i n . readabi 1 i ty and at the 
same time to l<eep readability independent of interest. Average readability 
was deliberately made high for third graders, so that there would be 
opportunities in the word identification groups for teacher intervention, 
feedback, and repetition. The average readability of the sentences on 
the Fry scale (Fry, I978) was k.3 and the range was from first to seventh 
grade. 

In order to make the position of sentences in the lesson independent 
of other sentence characteristics, threo blocks of sentences were formed. 
Each block consisted of a randomly chosen one-third of the sentences from 
each of the four interes^t categories describerd above. The sentences wftRfn" 
each block were then arranged in tl^ree separate random orders. Using the 
three block orders and the three randomization orders of sentences within 
each block, nine counterbalanced sentence pres'entation orders were formed. 
Nine sets of x 8i booklets were assembled. Each set contained one of 
the presentation orders of the sentences. Each page of a booklet displayed 
three of the sentences. 

The major dependent variable in this study was recall of the sentences 
from the lesson. In order to make the position of a sentence in the recall 
test independent of its position in the lesson, and independent of other 
sentence* characteristics, three additional blocks of the 36 sentences were 
formed. Each of these three blocks contained a randomly se I edted third of 
the sentences from each of the^ three sentence presentation blocks. Nine 
coynterbalanced recall orders were then developed, using the same procedure 
as with the presentation orders. 
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The standardized reading comprehension test was from the Metropolitan 
Achievement ^est. The reading fluency test consisted of four lists of 15 
unrelated words, adapted from Mason (1976), which the child read as fast 
as possible. A fifth list was constructed by selecting 16 difficult words 
from the 36 sentences used in the lesson in order to assess the effects of 
the lesson on speed and accuracy of word identification. A 58 question 

» 

personality test was developed by adapting items from other tests used to 
measure introversion-extroversion (Eysenck S Eysenck, 1975), test anxiety 
(Hill, 1980), and internal locus of control (Crandall, 1975).^ 

Procedure 

Teachers were asked to rank order every student in their classes in 
terms of their judgments of reading ability. These ranked lists were then 
sectioned into quartiles and used to select four-child groups either 
heterogeneous or homogeneous iif reading ability. Heterogeneous groups were 
created by. selecting one student from each quart! le while all four students 
in the homogeneous groups were from the same quartile. Whereas the groups 
were constructed based on teacher judgment, the measure of group ability 
used in the data analysis was a composite that gave equal weight to teacher 
judgment, standardized reading comprehension score, and word reading fluency 
score. 

Initially each, c I ass room was randomly designated for eithec hetero- 
geneous or homogeneous grouping, but it was' sometimes impossible to 
construct more than one or two complete homogeneous reading groups from 
each quartile. When It wasn't possible to place all students from a given 
quartile into homogeneous groups the raniaining students were placed into 
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heterogeneous groups. Additional children were taken from other third 
grade classes in the same school if they were needed to complete a reading 
group. 

The experimental reading instruction was conducted during the class' 

regular reading period. The experimental teacher, a male research 

assistant in his late 20's, called four students that formed a reading group 

to a quiet corner of the classroom where they were seated in a semicircle 

in front of him. .The group was assigned to recejve either a meaning or 

word identification emphasis according to a pred,etermi ned, random schedu le. 

Initial instructions to the group were the same tor both the meaning and 

word identification treatments, as follows: 

I have some sentences for you to read today. \ You'll take turns 
reading the sentences out loud, just like you .do when you're in 
• your reading group in class. This means that when one person 
Is taking their turn reading, out loud, the. rest of you should 
follow along in your own booklet reading each sentence silently 
to yourself. Please don't say -anythir - or make any nbise when 
another person is reading because we w ■ to make sure that we 
hear clearly every word that the person who is reading says. 

_ The teacher then chose booklets containing one of the nine sentence 
presentation lists according to a predetermined, random schedule and handed 
one booklet to each student.^ Instructions for the treatment condi^tions < 
were given it this point. Students to receive a meaning emphasis were toJd: 
When you read the sentences I want you to read each sentence so 
that you know what it's about. We'll be asking you some questions 
about the sentences later. As you read each sentence, I want you 
to think of a little story that the sentence could be part of, and 
think of what might happen next after the sentence. Then tell us 
what wi 1 1 happen next. 
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Instructions for the word identification treatment were: 

When you read the sentences, I want you to read each sentence 
carefully. We'll be asking you some questions about the 
sentences later. As you read each sentence, I want you to try 
to say every word in the sentence exactly right and clearly. 
If you make a mistake on a word, I'll help you, and then you 
should read that word again so that it's exactly right. You'll 
need to pay very close attention to all the words. 

Both groups were given two practice sentences. All four students in the 
group were given the opportunity to read each of the practice sentences. 
In the meaning condition each student was asked to relate to the group what 
he or she believed would happen next, while\^n the word identification con- 
dition the teacher emphasized to each student the importance of reading 
each word carefully and saying it exactly right. 

_ After answering any questions from the students, the teacher assigned 
a student to begin reading.-- Who would read first was based on seating 
position. Seating positions were labeled one through four with the student 
on the teacher's far left in position one'and the student on the teacher's 
far right in position four, Tlie starting point rotated from the seating 
position one to seating position four, returning tp pos i t ijajj one after every 
fourth reading group. Students then took turns reading, the' sen tences-'- 
^orally in '^Vound rob it>" 'fashion. A turn was 'defrned as' read'ing the'threS 
sentences on one page of the experimental booklet. 

In the word identification treatment, the teacher promptly corrected 
every reading miscue that the oral reader made and then had the student 
repeat the word and reread the whole sentence. Even if the child sponta- 
neously corrected an error, he or she had to reread the sentence until one 
perfect rendering of the sentence was achieved, when there was a long 



The Reading Group 
19 

■ hesitation, the teacher supplied the word, and the child finished the . ' 
sentence and then read it again. Performance was praised by saying "that's 
right" or "good reading." " 

During meaning emphasis instruction, miscues were ignored unless the 
meaning of the sentence was disrupted, in which case the teacher supplied 
the word and the child went right on. When necessary, the teacher prompted 
for a continuation to the sentence by saying "and then what happened?" 
Performance was praised with "very good" rather than "that's right," in 
order to encourage reflection about possible meanings instead of a percep- 
tion that there was a "correct" answerl 

Upon completion of reading instruction, each student from i he group 
went to one of four, research assistants to receive the fluency test and 
Ihe recall test. To. introduce the fluency test the research assistant said: 
I have five lists of words for- you to read today. J wi 1 1 hand ' 
■you one^ist afa time. ,Your task is ,to read the words out loud. . ' 
< to me as fast as you can making as few mistakes as you can.' If ' 
there's a word you can't read, just say/'skip it" and go on. 
Don't stop to try to figure out words you don't 'know. ■ 
The experimenter recorded all reading miscues and used a stopwatch to time ^ 
the students al they; read each^word Hst_. Ti;i,e was rec6rdecf to tenths of.' 



a ''second I 



Following the fluency test, the research assistant administered a test 
that required the student to recall the 36 sentences. One of the nine 
ordersMor sentence recall was assigned to the child based on a predeter- 
mined, random schedule. The "due" for each sentence consisted of the 
subject noun and its modifier. For warm up, t!.e student was giver dues 
to the two practice sentences used initially. For the practice sentences 
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me 



only, the clue was elaborated with a question such as, "Can you tell 
what the 'fat cook' did In the sentence you read earlier?" in order to mal<e 
sure the chi Id understood the task. The child's answers were marked on a 
form that listed all of the sentences. The assistant crossed out the parts 
of a se,ntence that a child failed to mention and wrote down any words the 
child included that did not match those in the sentence. 

Following recall the student was thanked for participating and sent 
back to class. A few weeks following reading group Instruction the 
Metropolitan reading comprehension subtest was administered to each class. 
The entire class was tested as a group in its regular classroom by one of 
the research assistants. On another day the test measuring the various 
personality characteristics of the child was administered, again in the 
regular classroom by one of the assistants. 

A simple, objective method of scoring sentence recall was employed in 
'the main data analysis. The number of content words th^t "a child repro- • 
duced from a sentence was counted. This count included not only words 
recalled verbatim but also synonymous words artd phrases. Then, sirtce the 
sentences varied in length, the number of content words recalled was 
divided' by the number of content words in 'the' -s entente, (not including the 
subject noun phrase which compri^sed the clue). The child's recall of the 
sentences was expressed as a percentage. A subsidiary analysis used a more 
Jenient scoring system in which a child got full credit for recalling the 
gist of any part of a sentence. 
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- Results and Discussion 

Overview of Analysis 

Table I includes the mean, standard deviation, and range for each 
variable except derivative variables such as the fluency of the oral 
reader, the interest value of the prior sentence, the quadratics of 
continuous variables, and interactions. Table 2 summarizes the analysis 
involving the principal dependent variable, percentage of s'entence recall 
Factors are listed in order of entry into the equation. This table con- 
tains just the "reduced" model, which Was compiled by deleting all non-' 
significant interactions and rerunning the program. ^^Iso deleted from 
the reduced model were nonsignificant main effects, provided the factors 
were not involved in any significant interactions. Al together, about 200 
main effects and interactions were examined. The examined terms include^ 
all two-way interactions except those among child characteristics (and 
those involving one other factor). None of the possi.ble 1»060 three-way 
interactions and none of the interactions involvingvfou-r or more factors 
was included. There was no theory to guide the selection of higher-order 
teri^is. Aimless exploration would have vitiated the power of the study-, 
and yieWed nothing -excejit a large computer bill. 



Insert Tables I and 2 about here. 



No computer program was available that could handle the entire analysis 
in one pass. It was necessary to run a trunca'ted form of the complete model 
that included l^iO variables, delete variables representing nonsignificant 
interactions and quadratic terms, add the remaining 60 variables, and run 
the program again. The program used was BMD-P2R. 

2o 



Table 1 
.Description of M'easures 



• 

Variable 


Measurement 
Scale/Method 

* • 


Meaning of High, 
Positive Score 


Mean< 


■ 


SD 


•* 
• 

Range' 

Smallest Largest 
Value Value 


Sentence Reca 1 1 


Percentage 


Good Recal 1 


28.51 




.30 


2 




Instruct ional Time, 


Mi nutes 


Long 


13.09 


3 


.79 


• 6 


22 


Word .Errors 


Freauencv 


1 ICI 1 1 Y 


^.01 


A 
4 


• /j 


0 


16 


Word Reading Time 


Seconds 


Slow 


26.18 


22 


.I'l 


6.i» 


1-3^.3 


Rea d i ng Comp rehen s i on 


Nat iona l 
Stanine 


High 

0 


5. OA 


2 


.05 

1 


I ' 


9 


Reading Fluency 


Local 
Stanine 


• 

Fluent 


5.03 


I 


.93 


■-.75 


9.02 . 


Sex 


Contrast 
Coded 


Boy 


.00 


I 


.00 


-1 


1 


Ethnici ty 


Contrast 
Coded 


White 






.89. 


., r . , 


1 


1 nt rovers i on- Ext rovers i on 


Sel f-Report - 


Extroverted 


12.03 


2 


.80 

t 


6 


20 


Anxiety 


Sel f-Report 


Not anxious 


8.5^ 


3 


57 


1 


\k 


Responsibility Accepted 


Sel f- Report 


High 


8.37 


I 


90 


3 ^ . 


12 


Blame Accep* ed 


Sel f-Report 


High 


5,^7 


2 


32 ' 


0 


1 1 
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Table 1 ((Tont'cr) 



\rariable 



Attention Span 

Meaning vs. 

Word Ideotif icat ion 

Group Abi 1 i ty 
Group Homogeneity 
Active Turn? 



, Cue-Response 
Association 

Readabi I i ty 



Position in Lesson 
Position in Test^ 
Total Interest 
Differential Interest 



Measuremltit Meaning of l+igh, 
Scale/Method Positive Score 



Range 



Mean 



SD 



Smal lest Largest 
Value • Value 



Percent i Le 
Rank, 

Contras^t 
Coded 

Loca 1 
Stanine' 

Standard 
Deviation 

Contrast 
Coded 

Adult 
Ra t i ng 

Grade 

Equivalents 

Serial 
Posi ticn 

Serial 
position 

Peer 
Rat Ing 

Peer 
Ra t i ng 



Long * 

Mean! ng 
Emphas is 

High 

Heterogeneous 

>^ . / 

Yes 



Highly 
Associated 

Diff icul t 



End- 
End 

Interest ing 
Girl Oriented 



51 
-.03 
5.02 
1.18 
-.50 
k,20 
k.28 
18.50 
18.50 
1.52 
.06 ■ 



29 
1 .00 

1.27 
1.12 

.87 
1.27 
1.68 

10.39 
10.39 
.88 

M 



2.1 
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1.0 
1 



1 



.27 
.61 



98 ... 
1 

7.k 

2.25 

I 

6.93 

7.0 
36 
36 

2.82 
1.10 
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- Table 2 

Summary of Regression Analysis for^xperiment 1 



Variable 



Between Subjects 

Reading Comprehension 
Reading Fluency 
Sex 

Meaning vs. Word Identification 
Group Abil i ty 

Meaning vs. Word Identification x 
Group Abi 1 r ty 

Constant/Residual 



Regression Coefficients 



Between 



^1.22 
- .87 

1 .92 
-8.ii8 
-2.21 

2.26 
22.75 



Total 



2.h3 

- .35 
1.99 
2. 80 

- .73 



% Variance 



10.97 
1 .69 
1.7^ 
3.81 
2.66 

3.80 
75.33 



35.56 
5.^8 
5.63 

12.3^ 
8.63 

12.31 



Wi thin Subjects 

Cue-Response Association 
Readab i I i ty 
Position in Lesson 
Position in Test 
Total Interest 
, (Total Interest)^ 
Differential Interest 
Active Turn 
.Reading Fluency of Active Turn Taker 
Position rn Lesson x Active Turn » 

23 



5.^2 
.01 

- .07 

- .^5 

7.87 
-2.63 
11. A8 
.52 
-1.65 

..12 



5.11' 
.07 
.01 
•.21 
1 .65 
.09 
.00 
.52 
.17 
.10 



^198. 80 
7.09 
1.13 
20.27 
161.01 
8.90 
• .07 
50.67 
16.89 
lo". 13" 
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Table 2 (cont.) 



Variable 


Regress ion 
Dctween 


Coefficients 

Tn fa 1 
1 0 La 1 


■xi va r 1 dnce 


r 

£L 


Position in Le<;<;on x Total lntprp<;t 




• 1 T 


no 


ft on 


Mean i no v*; Word 1 Hpn 1 1 fi rat i nn 

* l^ul llll^ W 3 * Yi\j lU IxJdlLI 1 IktfOLIUIl 

X Readabi 1 i ty 




-1 .20 


.23 


r 

22.63 


Meaning vs. Word Identification 

X Reading Fluency of Active Turn Taker 




2.26 


\28 


27.0 


Meaning vs. Word Identification 
X Differential Interest 




•* 

1 .03 


.07 


6.5U 


Group Ability x Differential Interest 




-2.35 


.18 


18.02 


Sex X Differential Interest 




-3.91 


.2k 


23.19 


Reading Comprehension x Total Interest 




.65 


.10 


9.i»6 


Constant/Residual 




- .10 


90.88 





Note ; The degrees of freedom for between-subject factors are 1 and 2^^; the degrees of freedom for 
within-subject factors are 1 and 8865. An £ of 8.18 is significant at the .01 level, two- 
tailed. ^ 

/ 
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The program was run In strictly hierarchical order.. In the portion of 
the analysis in which the total variance was partitioned, the variables 
were entered in blocks as follows: within-subject factors, quadratics of 
selected within-subject factors, interactions among within-subject factors, 
between-subject factors, and, finally, interactions between wi th in-subject 
and between-subject factors. Within blocks, order of entry was based on 
logic or theory. In cases in which no a priori grounds were discernible, 
variabl.es were entered stepwise in order of variance accounted for. The 
between-subject factors were evaluated in a separate analysis; they were ■ 
included in the total analysis In order to get proper estimates of inter- 
actions of within-subject and between-subject factors and appropriate beta 
weights. 

The F_ ratio for each factor was constructed off-line based on the 
2 

increment in R at the point of entry. Naturally, between-subject factors 
were evaluated in terms of between-subject variance and within-subject 
factors in terms of within-subject variance. In each case, the error 
variance was the final residual after all variables had been entered into 
the equation. ^ 

The unit of analysis was the individual subject's performance on 
individual sentences. There were 36 sentences and in the main analysis 
there were 259 subjects; hence, there were 932^ observations. The degrees 
of freedom for wi th i n-subjects tests of significance in the main analysis 
was N(K-1)-V = 8865, where jN is the number of subjects, K is the number of 
sentences, and V, which equalled 200, is the number of variables tested for 
significance. In the subsidiary analysis Involving the noncognitlve 
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characteristics of the children there were 161 subjects and Sk^S degrees of 
freedom. 

The nominal alpha level for each test of significance was .01. 
Significance was probably overestimated somewhat since the assumption of 
equal sentence by sentence intercorrelations was violated. Because about 
200 tests were performed, the expected number of false alarms was approxi- 
mately two. In order to avoid clutter, most effects will be referred to 
in the te^t simply as "significant" or "nonsignificant." The supporting 
detail for significant effects can be found in Table 2. 

The percentage of variance explained by within-subject factors may 
appear to be rather small. The value seems small because we did not 
aggregate across subjects to get a single average result for each sentence, 
which has been the standard practice in readability research (e.g., Bormuth, 
1966) and psycholinguistic research (e.g.. Just & Carpenter, I98O). Aggre- 
gation of results causes huge increases in proportion of variance that is 
apparently explained. However, this is a practice that requires the 
untenable assumption that there are no systematic individual differences 
among subjects," in particular that characteristics of subjects and attri- 
butes of tasi<s, settings, and materials do not interact. "Accounting for 
variance" is not the point of educat ionaT and psycholinguistic research. 
Rather, the goal Is to understand human comprehension, learning, and 
remembering as these processes may be conditioned by task, materials, and 
setting. 

The present study used a longer list of sentences than ?s customary "* 
in experimerTts with children. This was done in order to appraise what 
would happen when the novelty of, the experiment had worn off somewhat. An 
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undesirable side effect of a long list was that level of sentence recall 
was low, only 28.5%. An experiment is like a test in the sense that its 
discriminating power is greatest when the average level df performance is 
around 50%. When the level deviates substantially from S0% the relation- 
ships among variables can be distorted and spurious interactions can arise 
because of "ceiling" or, in this case, "floor" effects. 
^ Another problem when level of performance is low is that the absolute 
size of effects is constrained. This is not inherently a problem, except 
that it may lead th6 unwary to conclude that though an effect is significant 
its size is not large enough to be practical. The absolute size of an 
effect U a poor guide to decision mai<ing. A better basis would be pro- 
vided by getting a sense of how the experiment is "calibrated." This can * 
be done by examining the size of the effect produced by a "familiar bench- 
mari< variable. In-the present study the benchmari< variable might be 
performance on the standardized reading comprehension te^.* On the average, 
before any other- factors were consi-der-ed," each staniri^ Increase, in reading 
comprehension score resulted in a 2. 15% -increase in senWe recall. 
Considerirrg the -fuj 1 . range of performance on the,readirtg comprehension 
test, on the average a'child who scored at tHe 9th stanine recalled 19.4% 
more sentences. than a qhild who scored at the 1st stanine. 

i 

The remainder of this section is organized around the fi^e questions 
raised in the introduption. The answers. to these questions -are nbt inde- 
pendent. This means that 'it wi^l 1 'be necessary to^discuss some 'results " 
several times in different contex-ts. 
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Meaning Em phasis Versus Word Identification Emphasis 

The first question is, which is more effective in a reading lesson, an 
emphasis on meaning or an emphasis on accurate word identification? The 
answer from this study is that an emphasis on meaning significantly pro- 
moted mastery of the lesson, as evidenced by the children's recall. On the 
average children in groups that received a meaning emphasi? recalled 5.7% 
more sentences than children in groups that received a word identification 
emphasis. However, this finding has to be qualified because the teaching 
emphasis interacted significantly with three other factors. 

'■'The teaching emphasis Interacted with the readability of the 
sentences. Figure 1 shows that the easier the sentences the 
greater the advantage of a meaning emphasis. 

"The teaching emphasis interacted with average reading ability 
of the group. This interaction will be discussed in the next 
section on grouping. 

''The teaching emphasis interacted with the fluency of the child 
whose turn it was to read aloud. This interaction wi U be 
discussed in the section on turn taking. 



Insert Figure 1 about here. 
-• 1 J— 



It is most -interesting that the expected positive relationship between 
readability and performance was actually reversed in tne groups in wWch 
accurate word identi f ication was stressed (see Figure l). This is readily 
understandable when you consider the instructional procedure in these 
groups. To reiterate, the experimental teacher corrected each miscue and 
then the child read the sentence again. If there were still miscues the 
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Figure 1 



Percentage recalled as a function of teaching emphasis and readabili 
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whole procedure was repeated again until the child wjis able to give the 
sentence. a perfect reading. Thus, a difficnlt sentence provided a sub- - • 
stantial opportunity for feedbaci< and rehearsal. In contrast, an eas^ 
flawlessly read sentence whizzed by with little opportunity for learning. 

An important finding is that there was no interaction bet^ween the 
child's reading Ability and the teaching emphasis, F < 1.0. On its face 
this finding points to the "conclusion that a meaning emphasis gives better 
results than a word identification emphasis with third graders at every 
level of reading proficiency. Hc^ever, we are obliged to report that this 
conclusion is not entirely^ data driven. It hinges on the policy we adopted 
for doing the data analysis. This" policy caused us to enter the group 
ability by teaching emphasis interaction into the equation before entering 
the individual children's reading comprehension ability by teaching emphasis 
interaction. An alternate analysis showed that the interaction with the 
measure of individual ability is-nearly significant, H\ ,Zkh) = 7.22, 
.01 < p < .02, % Mar = 2.23, B = 1.07 when it is entered first. 

It should be stressed that the facts do favor giving priority to the 
group ability by teaching emphasis interaction. When this interaction is 
entered into the analysis first, the amount of variance explained by the 
indi'vidual abjiity by teaching emphasis ihteraction is nil, as we haye just 
explained. On the 'other hand, when the individual ability' by treatment 
interaction Is entered first, the group ability by treatment interaction is 
almost significant, H] ,zkk) = 6.08, .01 < p < .05, % Var = I. 85, B = 2. 5k. 
These facts suggest that everything in -the individual interaction term is 
also represented- in the group interaction term, but that the group term may 
Include something else of importance in addition. 
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Whether there is really an interaction between the level of reading - 

proficiency of the child and the type of teaching emphasis is an issue 

with important educational policy implications. It would be unacceptable 

to let a conclusion -about an issue 'of such importance depend upon the 

approach used to perform an analysis or upon what could, be undependably " 

fine nuances in a set of data.. Therefore, 'we took the position that the ' 

possible interaction of individual reading "comprehension ability with 

treatment had to be giC^en sef ious attention. 

# ■ * 

A graph .9f the possible interaction is presented in Figure 2. If 
the interaction is accepted as trustworthy, then the higher the. ability of 
the child the greater the advarftage of an emphasis on meaning. Conversely, 
when the ability of the child is low, there is little difference between 
treatments, or even a slight advantage for a word ideoti f ication emphasis. 



Insert Figure 2 about here. 



It must- be cautioned that the possible small benefit of a word identi- 
fication emphasis with very slow children shown in Figure 2 may be due to 
an artifact. The level of learning in this study was low because the list 
of sentences was long and average di<=ficulty was high. Thus, as we already 
explained, there may have. been a performance "fjoor." In other words, the 
performance of poor readers may have been- so low already that there was no 
room on the downside for even ^poorer performance to show itself in the word 
•identification condl^tion. Experiment 2 was designed to see whether there 
is a real benefit from a woed identification emphasis for children with low 
ability, or whether the possibhe slight advantage in this study was arr 
artifact. • - . ' 
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Figure 2 



Percentage recalled as a function of teaching emphasis an'd individual 
reading cdmprehension ability. 
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. The principal i^eason for the difficulty in determining whet her "the re 
was really an interaction between the individual child's reading ability 

> 

, and the reaching emphasis was the correlation ( r = .56) between Individual 
ability and average group ability. A simple strategem was used in ■• 
-Experiment 2 to unconfound the effects: The children were instructed 
individually, so the issue of group ability did not arise. 

Grouping for Instruction 1 ■ ^ 

The question' that we shall attempt to answer in tills section is whether 
grouping children according to ability Increases the effectiveness of 
reading lessons. Allow us to emphasize again that In this study we have ' 
defined effectiveness in terms of degree of mastery of a single day's 
lesson, as indicated by the percentage of the sentences from the lesson that 
the children were >able to recall. 

The measure of homogeneity was the standard deviation of the composite 
reading scores of the children in a group. This variable had absolutely 
no effect and did not enter into any Interactions. Havever, the mean level 
of reading ability of the groups did have a substantial effect and was 

involved in two significant Interactions- / 

/ 

'^As reported in the preceding section, grou/ ability interacted 
with the teaching emphasis. Figure 3 Ind/cates a strong negative - 
relationship between group ability and s^tence recall in groups ■ 
receiving a word identification emphasis', but no relationship in 
groups receiving a meaning emphasis-. / . 

*Group ability interacted with the relative interest of sentences 
to boys and girls. This int'eraction' is discussed in the section 
on mjterials. 
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Insert Figure 3, about here. 



To dispel some of the mystery about the negative relationship shown 
in Figure 3 between gro.up ability and sentence recall in the word Identi- 
fication groups, let us emphasize again that there was a positive relation- 
ship between the Individual child's reading ability and recall; however, 
regardless of a child's own ability he or she recalled more of the sentences 
when the average group ability was low. The principal reason this happened 
Is that the lower the ability of a group tne longer was the instructloral 
time. In groups receiving a word identification emphasis, but not those 
getting a meaning emphasis, instructional time was strongly related to 
lesson mastery. These facts are ambocied In the causal model diagrammed in 
- .Figure kA. The arrows mark causal paths. An ?rrow between two boxes 
indicates that the, first factor is hypothesized to be the cause, or part 
of the cause, of the second factor. Paths from unmeasured causes are not 
included. The structural equations for this model, written in unstandardized 
form, are as fol lows: 

ii3=%-^Pl3>il +P23><2 

iSi, = ii, + £34213 

2% =% +£15211 +Pi,5><i, 

where ii,,.X2, and are expressed in stanines, x^ is expressed in minutes 
' and Xr is expressed in percentage or recall. 

Insert Fi()ure about here. 



It is cisar that the model gives a good account of the dynamics in the 
groups that had a word identification emphasis. Each stanlne decrease in 



Tne Reading Group 
36 

Figure 3 

.Percentage recalled as a function of teaching emphasis and group ability. 
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Figure 4A 



IZl'^l hT^' °^ relations between reading abUfty. instructional 

time, and lesson mastery. 
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group ability cau-sed 2.7 minutes Increase in instructional time, 'and each 
minute increase It Instructional time caused a 1.03% increase in sentence 
recall. Thus, ons stanine change in group abilh:y led via an effect on 
time to a 2.7 x 1.03 = 2.78% change in lesson mastery. 

It is equally clear from Figure AB that the model does not capture 
the dynamics of meaning emphasis groups. Whereas decreases In group ability 
led to increases in instructional time, there was only a very slight con- 
nection between instructional time and sentence recall. The net effect was 
only a I.78 x .0? = All changl&in lesson mastery when there was a one 
stanine change in group ability. 



Insert Figure hi about here. 



The interesting conclusion that is clarified by the causal model is 
that lessons wich a word identification emphasis depend heavily on time for; 
their effectiveness: Instructional time makes little difference in meaning 
emphasis lessons; here it is the quality of the child's encounter wi th the 
lesson material tha.-. is important, not the quantity. 

Based on other research we may be reasonably confident that there is 
one aspect of these conclusions which will not general ize to natural class- 
room settings. In actual classrooms there is typically a pos 1 1 i ve relation- 
ship between functional instructional time, or "opportunity to learn," and 
group ability (Hiebert, I982). In the special circumstances that prevailed 
In the present study the experimental teacher allocated more time to make 
up for the slower pace in the low groups. Teacher effectiveness research 
shows that student ochievament is a function of the pace (that is, the rate 
at which the curriculum Is covered), allocated time, and the proportion of 
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Figure 



Partial causal model of relations between reading ability, instructional 
time, and lesson mastery. 
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allocated time In which the student Is actually engaged (cf. Berliner S 
Rosenshlne, 1977; Leinhardt, ZIgmond/s Coolly, I98I). Furthermore, class- 
room research suggests that the pace is slower in low ability groups 
(Beckerman & Good, 198l).and that, whereas allocated time is about the 
same in high and low groups, usable Instructional time and engagement rates 
are considerably lower in low. groups (cf. Allington, I98O). 

Thus, there is reason to doubt the overall generality of the results 
encapsulated in the model diagrammed m Figure i,A because we made sure all 
groups covered all of the material. However, there Is one aspect of the 
conclusions that desefrves dose attention from Instructional researchers. 
We know of no reason to doubt the generality of the finding that the 
success of meaning emphasis instruction depended very little on Instruc- 
tional time. This method supceods .because of the quality of the children's 
encounter with lesson materials. Could It be that the reason that instruc- 
tional time has received such prominence in recent years is that the 
methods employed by classroom researchers are capable of identifying 
instruction whose degree of success depends upon time but incapable of 
distinguishing methods whpse success depends upon qualitative charac- 
teristics? 

Turntaking During Instruction 

The principal question that we attempted- to answer is,, does the active 
participant (the child who is called on to read aloud and respond to the 
teacher's instruction-) get more from the lesson than the other participants 
(the remaining children in the group who are reading silently and 
listening)? The answer based on this study -Is "yes." Children achieved 
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significantly greater lesson mastery when they were taking a turn than when 
they were foVlowing along. The mean advantage was To our knowledge, 

this study provides the first direct evidence of the benefits of playing 
an active role, even though educators have always believed that active 
engagement is important. However, further research will be required to 
determine whether active engagement was really the operative factor. Since 
any one child's turns encompassed only a quarter of the sentences, these 
sentences were, therefore, more distinctive than the remaining three 
quarters.. Thus, distinctiveness could have been the operative factor' 
Instead of active engagement. 

There was one significant interaction Involving role: 

-Whether a child was taking a turn Interacted with the position 
of the sentence In the lesson. Figure 5 shows that when the 
child was the active participant performance improved through 
the course of the lesson, whereas performance of the'other 
participants remained about the same. 



Insert Figure 5 about here. 



One reason we used the predictable, "round robin" turntaking procedure 
was to make It obvious when a turn would begin so that we could determine 
whether there was. psychological preparation for turntaking that interfered 
with the processing of material then being covered. Specifically, we looked 
at recall of sentences that immediately preceded active turns. No. effect 
was observed, F < 1.0. Performance was indistinguishable from other 
sentences encountered when the child was playing a passive role. It should 
be noted that the physical layout of the mss rials made it difficult for a 
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Figure 5 ~ 

Percentage recalled as a function of turn taking and positron of sentence 
in lesson. 
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child to actually practice the sentences he or she was going to have to 
read aloud, because they were always printed on the next page. 

It could be that taking an active turn engages attention and that, 
once engaged, attention persists for a period after the turn has ended. 
This possibility was evaluated by examining performance on sentences that 
Immediately followed active turns. The finding was negative, F =1.37. ' 
Performance was no different from performance on other sentences encountered 
when the child was playing a passive role. 

The active participant's reading fluency was a salient characteristic 

that figured in two significant effects. The measure of fluency^ It will 

be r&called, was speed of reading /our lists of 15 words. The fluency of 

the oral reader had a significant negative effect on lesson mastery. On 

the average, there was a .8% decline in sentence recall for each stanine 

increase in the fluency of the oral reader. The oral reader's fluency was 

involved in a significant interaction: - 

"The fluency of the oral reader of the moment interacted with 
the teaching emphasis. Figure 6 indicates that lesson mastery* 
was a s ha rpl)^ decreasing function of the oral reader's fluency 
when there was a stress on word identification, but that the 
oral reader's fluency made less difference when there was a 
stress on meaning. K 



Insert Figure 6 about here. 



The negative effefct of the fluency of the oral reader in the word 
identification emphasis groups can be understood in terms of the causal 
model introduced earlier. Only now'we can refine the model. It Is the 
. . fluency of the oral reader that is critical, not other aspects of ability. 
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Figure 

•Percentage recalled as a function of teaching emphasis and the fluency of 
the oral reader. ' 
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Of course, there a moderately high correlation between reading fluency 
. and reading comprehension. In this study the correlation was .62. Often 

< A 

When variables are Intercorrelated it is impossible to identify the one 
which has the cdusal force; however, in this case a clear determination is 
possible. The critical fact \i this: The reading comprehension of the 
oral reader, when er^tered into tlie equation instead of the fluency of the 
oral reader, does not have a significant effect ( F .= l.l?), despite the 
correlation between the two measures. This proves that the fluency of the 
oral reader is the operative aspect of his or her reading ability. 

It is entirely reasonable that fluency, rather than comprehension, is 
,the importanl^^pect of children's ability when they are reading 'aloud in 
a reading group, at least when errorless performance is 'the goal. The 
teacher and the other children can readily distinguish a stumbling, halting 
rendition of ~a sentence from a smooth performance. -Comprelienslon, on the, 
other hand. Is ah Internal process with fewer manifestations perceptable 
to 1 Isteners. _ 

Figure 7^'presents an augmented causal model relating two aspects of 
reading ability, the fluency' of tlie oral, reader, instructional time, and 
le'son mastery. Notice that this mpdel differs from the one displayed in • 
Figure 4A in two respects. The first difference is' the separation of 
rending ability into two facets. The preceding paragraphs h^ave presented 
the reasoning and evidence for the claim, that fluency, not comprehension 
'ability, lies on the indirect path to lesson mastery mediated by Instruc- 
tlonal time. However, It Is comprehension, not fluency, tHat 1 los on the 
direct path. The evidence for this Is that when the fluency measure Is 
entered In the equation after the comprehension measure it has a nearly 
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significant negative relationship with lesson mastery. To state this in 
Other language, once the common component of the two tests ^as been 
factored out, the remaining^ trai t measured by the fluency test depresses 
performance. * ' • 



l^seVt Figure 7 about here. 



The second. difference between the ™del diagrammed in Figure and 
the augmented model is that fluency of the -oral reader, Vather than the 
group's abititV, is placed on the path mediated by instructional time. To* 
the extent that the fluency of. the children varies, 50 too wi>l the Instruc 
tional .time on the segments of the lesson during which the chilciren ace 

taking their turns! '"Regrettably, vie reccfrded "only total Ifesson time, \ ^ 

\ - . 

rather than time^on the si^gments encompass I ng the turns; thus, it is 

«• ^ « > 
impossible to provide quatiti tative 'estimates of the, parameters of the m^del 

set forth in Figure 7- Another problem is that the fluency of the oral 

reader is confounded wi th group' abi.l i ty. Either-the oral reader fluency " 

by treatment i nteract ion .or the group ability by treatment interaction, 

but not both, will -account for significant variance. This means that no 

strictly data-driven interpretation is possible. However, in this instance 

It ts our judgment that logic dictates that priority be given to the 

Interactfon involving the oral reader*s fluency. We believe, therefore 

that the model diagrammed in Figure 7 gives the best picture of the 

dynamics of the word identification emphasis groups under the conditions 

that prevailed in this study. Of course, the model does not fit the 

meaning emphasis groups. -> 
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Figure 7 



Augmented causal model relating reading abilities, instructional time,** and 
lesson mastery. , , ' 
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Materials 1 

The major question we asked about materials was, which is more 
Important, readability or interest? This study yields a resounding answer: 
"Interest." Interest accounted for 33.8 times as much variance in 
sentence recall as readability. This value does not Include the interactive 
and derivative effects of the two factors. When these are counted, interest 
explains four times as much variance as readability. . ' ■, 

The flacid effects of readability cannot be explained away because of 
restriction of range; In fact, the sentences ranged in readability from 
the first grade to the seventh grade level.. Thus, there is no escape from 
the qonclusion that Interest Is much more important than readability when 
the^cricerion is recall of information read and, '^of course, when there is 
a teacher available to iielp the children with the hard words. 

All of Lhe multivalued variables included in the study were checked 
to see if they had a curvilinear relationship with sentence recall. Only 
interest did, as is evidenced by the fact that the quadratic of rat^d 
interest was significant. As sentences move from uninteresting to 
interesting there is a sharp rise In performance, but the Increase is 
m^re gradual as the sentences move from interesting to very interesting. 
That the relationship is curvilinear is not especially noteworthy, since 
the Interest scale is arbitrary. 

The rated interest of sentences irtteracted with two other factors. 

'•'Interest interacted with the position of the sentence in 'the 
lesson. Figure 8 shows that interest was more Important for 
> ^ sentences appearing la.te in the lesson than It was for y 
sentences early in the lesson. 
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"Interest interacted with reading comprehension ability* Figure 9 
indicates that thare was a greater difference in the performance 
of high and low ability children on interesting sentences than 
on uninteresting sentences. 



Insert Figures 8 and 9 about here. 



It should be noted in passing that Figures 8 and 9 were constructed 
by selecting two extreme values of continuous variables. However, In each 
case the calculations to determine the best-fttting functions were 
performed on the entire array of 9,324 observations. 

We interpret the interaction of position and interest in the following 
fashion. An experiment Is a novelty, at least at the beginning. When the 
session starts the children are giving some of their attention to adapting 
to the unfamiliar teacher and the unfamiliar procedures. As the novelty 
wears off and they become more comfortable with che situation, factors'of 
Intrinsic importance such as the interest of the materials are likely to 
take hold:^ We beliSve the same explanation can also account* for the 
increasing impbrtance of active participation from the beginning to the 
end of the lesson (see Figure 5)^ If this interpretation is correct, it 
is reasopsblfe to infer that intere^^t and active participation would be even 
more Important^ under the daily routine of school than in a brief\^per iment. 

If one W6re to take the interaction between Interest and comprehension 
ability (Figure 9) at face value, it would seem to indicate that bright 
children are more sensitive to the interestingness of reading material than 
less able children. This conclusion does not agree with common observation. 
It seems possible, instead, that the interaction appeared in this study 
because of a performance floor which led to a distorted estimate of the 
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Figure 8 

irtheKessoT^'^^'' ^ function of interest and position of tlie sentence 




The Reading Group 
51 



Figure 9 



Percentage recalled as a function of Interest and reading comprehension. 
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relative performance of low'^bility children. The relationship between 
interest and ability was explored further in Experiment 2. 
5 It is a common practice to sprinkle colorful vignettes into children's 

social studies and science texts (Pearson, Gallagher, Goudvis, s Johnston, 
'.981). Often these vignettes have no conceptual relationship with the rest 
of the text. Apparently the theory is that interesting asides will attract 
attention and, once attracted, attention will be maintained for awhile, 
leading to better learning of the surrounding, less interesting material. 
We tested this theory by determining the influence of the interest of each 
sentence on the recall of the sentence which followed it in the lesson. ' 
There was absolutely no effect, F < T.O. Also investigated was the possible 
influence of the interest of a sentence on the recall of the sentence which 
immediately preceded it in the lesson. Again, there was no effect, F = 
2.27. Hence, there was no support for the theory that an interesting but 
unrelated piece of information will improve the learning of surrounding 
information. Evidently children switch attention on and off very rapidly. 
In fact, in related research (Anderson, I982) we have been unable to 
establish that amount of attention is even on the causal path between 
interest end learning. 

It is a reasonable conjecture that intrinsically interesting sentences 
increase learning by affecting the quality of children's processing. We 
have already uncovered evidence, discussed at length in preceding sections, 
that a teaching emphasis on meaning affects learning because of the quality 
rather than the quantity of processing. Hence, it is a plausible liypothesis 
that the influence of the two factors is mediated by the same underlying, 
qualitative process. An implication of this hypothesis is that the effects 
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of a meaning emphasis and interest wi-H not be additivej' since if the 
process has already been evol<ed by an interesting sentence having a meaning 
emphasis will be at least partially redundant. The results show, however, 
that the effects are additive, as evidenced by the failure of even a hint 
of an interaction to appear, F < 1.0. The conclusion is that the effects 
of a meaning emphasis and interest seem to be produced by different under- 
lying processes. 

Up to this point we have been discussing the total rated interest of 

sentences to third graders. We also investigated a derivative factor, 

differential interest to boys and girls, where a positive score means that 

the sentence was more interesting to girls and a negative score means that 

it was more interesting to boys. Of course, the absolute magnitude of the 

number indicates the size of the differential interest. Differential 

interest did not have a significant main effect, but it did enter into 

three significant interactions, as 'follows: 

"The differential interest of sentences to boys and girls inter- 
acted with the sex of the child. Figure 10 shows that 
performance was best when the orientation of the sentence 
matched the child's sex. 



insert Figure 10 about here. 



"The differential interest of sentences to boys and girls inter- 
acted with the teaching emphasis. Relatively speaking, a 
meaning emphasis was better with girl-oriented sentences while 
a word identification emphasis was better with boy-oriented 
sentences. 
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Figure 10 



Percentage recalled as a function pf sex and different 
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"The differential interest of , sentences to boys and girls inter- 
acted with group reading ability. Performance on girl-oriented 
sentences was relatively better in low ability groups than high 
ability groups whereas group ability made little difference on 
boy-oriented sentences. 

The interaction between differential interest and the sex of the child 
is not surprising, but we were gratified to see it appear because it repli- 
cated the findings of Asher and. his associates (see Asher, I98O) who used 
entirely different methods. There was also a nearly significant inter- 
action between total interest and sex, F(l,8865) = 5.30, .01 > p < .05, 
% Var = .06, B= 1.09, suggesting that boys may be more sensitive to 
interest than girls. This, too, is consistent with Asher's findings.. 

i 

We have no idea why differential interest interacted with the teaching 
emphasis and group ability. Perhaps these are cases whpse explanation 
would have become clearer if the sex of the child had b6en figured in; 
however, we were unwilling to open the Pandora's Box of higher order inter- 
actions. 

(t 

The findings regarding readability have already been reported. To 
review, readability did not have g significant main effect', but it did 
interact with the teaching emphasis (see Figure 1). 

in analyses that we wi 1 1 not report in detail, readability was repre- 
sented in terms of two factors, the length of each sentence in syllabl'es 
and the frequency of usage of the least frequent word in each sentence. 
These factors accounted for somewhat more variance than Fry readability 
level, but there was no important change in the conclusions. Hence, Fry 
readability level was reported since it is familiar to a wider audience. 
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There were two other significant aspects of the materials that have 
not been discussed yet. One was the rated degree of association between 
the subject noun phrase of a sentence, which served as the cue in the test, 
and the .rest of the sentence. • Obviously when the cue words makes it easy 
to think of the rest of the sentence recal 1 will be higher. Indeed, the 
degree of connection between the cues and the sentences turned out to be 
the most potent factor in the experiment. However, this fact Is of no 
general pedagogical or psychological interest. The factor was included 
In the analysis just In case It happened to be confounded with other factors 
such as readability or Interest, in which case serious misinterpretations 
would have been invited. Connectivity ratings were roughly orthogonal to " 
other factors and the method of hierarchical analysis adjusted for the 
slight biases that were present. Interaction terms involving connectivity ■ 
were not computed because they were of no theoretical or practical Interest. 

The final significant materials factor was the position of a sentence 
in the test. For each additional sentence there was a decline In 

sentence recall or a total decline of S.2% from the beginning to the end 
of the test. We attribute the decline to fatigue. The position of 
sentences in the test did not interact with any other factors. 

Chi Id .Characteri sties 

The general question raised in the introduction is whether the success 
of teaching methods, grouping arrangements, and materials depend upon the 
reading ability and other characteristics of the child. Most of the 
evidence bearing on this question has already been introduced and will 
merely be reviewed here. 
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There are just two child characteristics that had significant main 
effects— standardized -reading comprehension test score, and the.word reading 
fluency of the child taking an active turn. It is well-known that ethnicity 
is related to performance on school tasks, and this study Was no exception 
(ethnicity correlated .20 with sentence recall). However, we chose to 
enter ethnicity into the equation after the child's reading comprehension 
and reading fluency scores. In effect the question asked was, after taking 
account of a child's reading proficiency, can additional information be 
gleaned by looking at the child's skin color or listening to his or her 
dialect? The answer from this study was "no." Ethnicity did not produce 
a significant main effect and was not Involved in any significant inter- 
actions. The other child characteristics did not yield signifjcant main 
effects. These were word reading fluency (unless the child was the one 
reading aloud), sex, rated attention span, introversion-extroversion, 
anxiety, and internal locus of control. 

There were a total of five interac;;ions (including a possible one) 
involving chi Id characteristics. Four of these have already been mentioned 
as fol lows: 

'■'The fluency of the child taking a turn as the oral reader inter- 
acted with teaching emphasis. This interaction was discussed at " 
length in the section on homogeneous grouping. Figure 6 shows^v. 
that the oral reader's fluency had especially strong negative 
effpcts when there was a stress on accurate word identification. 

«The sex of the chJld interacted with the relative interest of 
the sentences to boys and girls. „ This interaction was reported 
in the section on materials. Figure 10 confirms that boys dfd 
better on boy-oriented sentences and girls did better on girl- 
oriented sentences. 
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^Reading comprehension ability Interacted with the interest of 
the materials. This interaction was discussed in the materials 
section. Figure 9 shows that there was a greater difference 
between high and low ability children on high interest sentences 
than on low interest sentences. 

"Finally there was a pqssible interaction between individual 
children's reading comprehension ability and the teaching 
emphasis. This Interaction was discussed in the section on 
teaching emphasis. Figure 2 suggests that the higher the 
child's ability the greater the advantage of an emphasis on 
meaning. 

One interaction not previously mentioned was also observed mvolvthg— . 

1 

a personality characteristic of the child, as follows: - 

I - - # 

''Mntrovers ion-extroversion Interacted with position of the 
sentence in the lesson,- F(l .sA'lS) = 8.93, £< .01, % Var = 
.1^, 5= .05. Introverts d\d better early In. the , lesson 
while extroverts did better late in the lesson. 

} 

We have no explanation for this interaction. A general conclusion Is 
that the yield from the noncognitive measures was low. , 

An Alternate Analy sis of Sentence Recall 

I , » 

In order to be sure that the findings were invariant over scoring 
procedures, the reduced model was run with the lenient, gist recall score 
as the dependent variable. The mean percentage of recall -that met lenient 
criteria was 38.6 and the standard deviation was 16.8. The findings were 
• essentially the same as in the preceding a^lysis, though some effects 
became stronger, and some weai<er. The following were the largest changes: 

Reading comprehension ability had less effect, £(1,2'»'») = ^1.89, 
£< .01, I VaF-= 6.9, B = Tbe c.uperiority of the meaning treatment 

■ 6^ 
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over the word identification treatment was greater, F{],2kk) =20.37, 
P< .01, % Var = 6. A, B = k.3k. Active turntlking made a larger difference, 
£(1,8865) = 67.U, £< .01, % Var = .69, b'= i,.39. The fluency of the 
oral reader had a stronger negative effect, £(1,8865) = 27.52, £< .01, ; 
% Var = .28, B = -1.25* 

l- 

The largest change between the two analyses was that Fry readability 
level now had a strong negative relationship with performance, F(l ,8865) = 
62. 7A, £< .01, % Var = .65, B= 1.83. The interpretation of this result . 
is that when the requi rement for a complete and precise answer is relaxed 
then the higher the grade level of the material the better is the recall. 



Performance on the Word IdentifTcation Test 

All of the results reported so far entail children's recall of 
sentences. Two other measures of lesson performance were the chi Idren''^ 
speed and accuracy in reading a list of 16 difficult words from the 
sentences. Number of errors and time to read the list in seconds were 
evaluated In analyses Involving the treatment factors, Indivldi\al differ- 
ence factors -(except the noncogniti ve factors), and all .the two-way 
interactions of the treatment and individual difference factors. 

The significant predi ctors 'of number of errors were the Reading 
comprehension test, F.{\ ,2kk) = 212.28, £ < .01, ^ Var = Al.OO, B = -2.52; 
the reading fluency test, F^{] ,2kk) = 22.31, £ < .oi, % Var = A. 31, B = • ' 
-.55; group ability, H\ ,2kk) = 16.85, £ < .01, ^ Var = 3.25, B = -2.i.9; 
and the individual reading comprehension ability by. group ability inter- 
action, £(1,2AA) = ^^2.A9, £<'.01,'^ Var = k.3k, B = .35. The mu'l tlple" ' 
correlation for the reduced model was .73. Whe.ther a child received a 
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meaning-, emphasis or word' identi f ication -emphasi s had abso'lutely no effect* 
on number of errors^ F< l.o', nor was the teaching emphasis involved In 
any Interactions. 

The significant predictors of time (in log seconds) to rfead the list 
were reading fluency, H\ ,M) = 1009.03, P < ~0l, % Var = 78.20, B = -.15; 
group ability, H\ ,2kk) = l6.8l,"p < .01, % Var. = I.30, B = -.03; and the 
reading fluency" by type of teaching emphasis interaction) H\,2hk) =| 7. 1 1 , 
£< .01, ^ Var'= .55, B = -.01. The mul t i pi e. correlat ion for the reduced 
model was .90. 

In Instruction, like the r^st of'th4 world,, you usua 1 ly j-get ..what you 
pay for. The obvio.us predi ctlpn was that the word identi f iCation^groups 

» 

would do better than the meaning' groups on the;lrst of hard words.' Whereas 
there was -ho ,di fference among groups in number of errors, the obvious ' 
prediction was partially confirmed In the case- af the time mea,sure,;,,.as 
can be seen from Figure 11, chlldren'who were below average ip 'reading 
fluency read the list of hard words faster if tfjey had reGei\7ed a word 
Identification emphasis. However, children of ^bove average fluency read 
the Jist at-about the same sceed whichever tr.eatment they had received. 
Thus; there was some evidence that receiving feedback on pronounciat ion 
and reading and rereading sentences improved automatlcity for below average 
readers. ' . , ' . . 



Insert Figure 11 about here, 



Experiment 2 ' 
The main purpose of the second experiment .was to see whether the 
principal findings of the«first study could be replicated. In particular, 
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• • Figure 11 

Time in Seconds (backtragsformed from log seconds) to read hard words 
a function of teaching emphasis and reading fluency. < 
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we were concerned to check the tenuous finding that che relative benefits 

of an emphasis on meaning depend upon the reading proficiency of the child. 

In the first experiment, an advantage for a meaning emphasis with less able 

children may have been obscured because of a performance floor. In the 

second experiment the list of sentences was shortened from 36 to l4.and 
* \ 

the words were made easier. The average Fry readability level dropped 

from ^.3 to 2.0; its range was reduced from 7.O to A. 5. As before, sentence 

interest was normed by another group of third grade children. Mo attempt 

was made to differentiate interest by sex this time. \ 

Meaning was emphasized in the same way as in the first study. That 

is, for each sentence the child was asked to elaborate on the sentence—to 

provide a continuation that told what might happen next. In order to 

Introduce .^r^ord identification emphasis, two difficult filler Sentences, 

upon which most children made some miscues, were included early in the 

lesson. This gave the experimental teacher the opportunity to dramatize 

that he or she wanted accurate, fluent pronunciation of every word. There- 

# 

after, almost all of the children were able to read almost all of the 

, J" 

senten-es with no trouble. Untie r both treatments, when a serious miscue or 
long hesitation did occur, the teacher supplied the word and the child kept 
going. 

There were several other differences from the first study. Children 
participated individually rather than in groups. This was done to get an 
unconfounded assessment of whether individual reading ability interacts 
with teaching emphasis. Two control conditions were included in addition 
to the meaning emphasis and word Identification emphasis treatments. These 
were a sllentv reading condition in which children were asked to read 
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carefully to themselves and a listening condition in which the -sentences 
were played from an audio tape. Finally, the measure of performance, 
recall of sentences, was modified in order to leave no doubt that compre- 
hension was. being assessed. The children were tested for recall using cues 
related inferential ly to the sentences contained in the lesson rather r.han 

words that, had appeared in the sentences. For instance, .'for the sentence 

II ' ' ' 

"The strong man chopped the wood," the cue was "axe." It is obvious that 

i 

recall of thi s; sentence woul d be unlii<ely unless it had been comprehended 
in the first place. The measures of speed and accuracy of reading difficult 
words from] the. sentences were omitted in this study because few hard words 
were used. 

I 

Design and Procedure i 
The subjects were 86 th| rd graders from Homewood, Illinois. The mean 
national stanine for the samiple was 6.3k and the standard deviation was 
1.73. Thus, the reading projitiency of the sample ranged from poor to 
very good W^ith more chi Idren gbove the national average than below. The 
children were taken from the i r\ class rooms one at a time. to receive the 
experimental lesson. They were^andomly assigned to the meaning emphasis 
condition, the word identi f icatii^n condition, the silent reading condition, 
or the listei;iing condition. \ 



After tl^e lesson the children Wre asked to read the. three lists of 
15 words as quickly as possible, skipping those which they could not read. 
These were onUlated to the sentences, used in the lesson, but yielded a 
measure of children's word reading fldency. Following this task the 
children were given a probe word, an uri,expressed instrument or instantiation 
of each of thejearller ]h sentences (not including the difficult filler 
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senteri,ces), and asked to tell the sentence it reminded them of. For 
instance, "The strong wind blew the roof off our house," the probe was 
the instantiation, "tornado."' In the earlier example, the cue, "axe," is 
the unexpressed instrument. 

The measure of reading comprehension was the comprehension subtest of 
the Metropolitan Achievement Test. It had been given by the classroom 
teachers five months before the study as a part of the school's regular 
testing program. 

Results and Discussion 

Overview of Analysis 

To facli;•^fl^e comparison with Experiment 1, a separate analysis of 
the meaning eiTiphasis and word identification emphasis condition was done 
first. Then an analysis of all of the data was completed. 

There were five between-subjects factors in the overall analysis. 
Entered first were reading comprehension ability and word reading fluency. 
Next came the treatment conditions entered as three orthogonal contrasts: 
TCI: Meaning emphasis versus the other three conditions; TC2: Listening 
versus word identification emphasis and silent reading; TC5: Word identi- 
fication emphas-s versus silent reading. The two-way interactions between 
the treatment conditions and reading comprehension ability and reading 
fluency were examined. 

Within-subject factors included the Fry readability level and rated 
interest of the sentences. Also included were four characteristics of the 
words which served as cues for recall: whether the cue was related to the 
sentence by an instrumental inference or an inference of Instantiation; a 
measure of the degree of association between the cue and the sentence; the 
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number of syllables in the cue; and frequency of usage of the cue. The 
latter three measures are of no Intrinsic interest. They were included to 
prevent variations in the difficulty of the test items from confounding 
the effects of readability and interest. In this study the measure of the 
association of the cues and sentences was the number of children in the 
meanin.3 emphasis group who included the cue word in their elaborations. 
All cwo-way interactions among the features of the cue. and sentences and 
between these features and the treatments and facets of reading proficiency 
were examined. The logic for computing statistics and assessing signifr- 
cance was the same as in Experiment 1. 

The interactions of treatment with other variables were computed by 
pooling the variance attributable to the interaction of each of the three 
. treatment contrasts with the other variable for a single test of signifi- 
cance. This is a conservative procedure which aci<nowledges that the 
grouping of treatments was arbitrary (and that other possible groupings 
would have been as well). If any interaction computed in this manner had 
proved to be significant, this would have been regarded as a warrant to 
explore the relationship In detail, but tone was significant. 

In an alternate analysis, the Fry readability level of each sentence 
was replaced with the mean frequency of usage (the Carroll, Davles, & 
RIchman, I971, Standard Frequency Index) of the content words, the frequency 
Df usage of the least frequent word, the total number of syllables in the 
sentence, and the mean number of syllables In content words. This analysis 
was somewhat more sensitive t.ian the one using Fry readability level; how- 
ever, none of the conclusions was different. Therefore, the analysis that 
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involved readability is reported here because the results will be under- 
standable to a broader range of people. 

Resul ts 

Meaning E mphasis Versus Word Identification Emphasis 

The meaning treatment proved to have a substantial advantage over the 
•word identification treatment,- F(l ,37) = 55.63, p < .01. The mean per- 
centage of sentence recall was 71.1% in the former case but only 39.8% in 
the latter. It .is especially noteworthy that in Experiment 2, unlike the 
first experiment, there was no "hint of m interaction between the type of 
treatment and the standardized reading comprehension score of the child, 
£(1,37) = 1.37 nor between treatment and readability, F < 1.0. It should 
be cautioned, though, that this experiment did not provide as strong a test 
as Experiment 1 of possible interactions, because the range of individual 
differences in comprehension and, particularly, the range of readability 
we.-e constrained. Nonetheless, it does seem reasonable to ir.fer that 
there was a performance "floor" in Experiment 1. 

An alternate interpretation of the difference between the two exper- 
iments is that the second one unconfounded individual and group ability. 
In the first experiment low ability individual children tended to be in 
■ low ability groups an^d low groups were allocated more time in which to 
learn the sentences; 

. The conclusion supported by both experiments is that a meaning emphasis 
gives better results with average and above average third-graders no matter 
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what the difficulty or interest of the materials^, length of the task, or 
grouping arrangements. Experiment 2 suggests, in addition, that" a meaning,' 
emphasis is also superior for poor readers, provided the tasi< is within 
their range. 

Comparison of All Treatments 

An average of ke.5% and 36-8% of the sentences were recalled in the 
silent reading and listening conditions, respectively. The figures for 
the meaning and wprd identification conditions, already reported, above, 
were 71.11 and 39.8%. The "meaning treatment gave much better results than 
the other three treatments, F(l,75) = 58.13, p < .01, % Var = 38.6, B = • 
22,6, whereas the other three treatments did not differ, F(2,75) = 2.19, 
p_> .10. 

figure 12 portrays sentence recall under each of the four treatments 
as a function of the r«ading comprehension stanines of the children. This 
figure is infcluded in order to allow an easy comparison with the results 
of Experiment 1 graphed in Figure 2. Figure 12 may appear to show an 
aptitude-treatment Interaction, but i t was not significant, F(3,75) = 1.63, 
jp > .10. The most striking fact revealed by Figure 12 is that the very 
poorest readers who received the meaning emphasis treatment performed 
better than the very best readers who received any of the other three 
treatments. 

Insert Figure 12 about here. 
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Figure 12 

Percentage recalled under each,of the four treatments as a function of 
reading comprehension. 




Reading Comprehension Stanine 
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Materials 



The aspects of materials that predicted recall were', first of all, the 
association between the cue word and the rest of the sentence, £(1,1093) = 
77.86, £ < .01, J Var = 6.01, B= I.03. The second significant predictor 
was another aspect of the cue,- the frequency of usage of the cue word, 
1(1,1093) = 47.03, p< .0], % Var = 3.63, B = -1.37. The more rare the 
word the better cue it made, a resuii consistent with the findings that 
Schnorr and Atkinson (1970), among others, have obtained with adult subjects. 

Interest had a strong relationship with performance, £(1,1093) = 57. 3^, 
P < ..01 , % Var = 4.43, B = 69.84. The quadratic of Interest was also 
significant, £(1,1093) = 14.96, % Var = 1.16,B = -23.25. Interest had 
approximately the same relationship with performance as it did in 
Experiment 1. 

Fry readability level did not influence performance, £( 1 , 1093) =^2.79, 
P > .05, % Var = .22, B = 2.20. Jn.fact, the direction of the relationship 
was wrong; there was a slight tendency for supposedly harder sentences to ~ 
be recalled better. Interest and the quadratic of Interest accounted for 
26 times as much variance as readability. The poor showing of readability 
in this study is perhaps not surprising, since all of the sentences were 
rather easy and the range was constricted. Nonetheless, the findings of 
the two experiments together leave no doubt that interest is much more 
Important than readability. 

'There was no significant Interactions in Experiment z. 
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Summary 'and General Discussion 
The major question this research was designed to answer was whether 
a meaning emphasis or a word identification emphasis gives better results 
In a third grade reading lesson. The answer was unequivocal. A meaning 
emphasis produced better results. Even in the first experiment, in which 
there was a complicating Interaction with the readabil'lty of the materials 
and a possible interaction with the reading proficiency of the children, 
there were very few sentences and very few children with which a word 
identification stress worked better. Furthermore, the second experiment 
suggested that these interactions werti probably spurious. The only clear 
advantage for the word Identification emphasis that appeared anywhere in 
this -esearch was that poor readers who got this emphasis were able to 
pronounce hard words from the lesson faster than poor readers who received 
the meaning emphasis. 

From the evidence on differential treatment of children In different 
ability groups, which Hiebert (1982) has summarized In an excellent recent 
review, it appears that low groups typically get an instructional regimen 
that resembles the word identi/ication emphasis used In the present 
research. Whereas no children are getting much comprehension Instruction 
(Durkin, 1978-79; Mason £ Osborn, 1-982; Nielson £ Rennie, 198I), Hiebert 
has summarized evidence showing that there is more attention to meaning in 
high groups. The present research does not indicate any need for nor 
advantage from differentia] treatment of high and Ijw groups. Though 
regrettably Experiment 1 left, some loose ends, both the pre/erred analysis 
of the data from this experiment and the data from Experiment 2 support the 
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conclusion that a meaning emphasis gets bitter results than a word identj- 
fi cat ion emphasis with poor readers as well as good readers. 

Perhaps one reason ehere is so little comprehension instruction in 
the typical classroom, particularly for poor readers, is that the schools 
operate on the presumption that once children have learned to decode they 
will comprehend by doing what comes naturally. This is a dangerous pre- 
sumption, one which is Inconsistent with tht data presented here, 
particularly the data from the second experiment. If "doing what comes 
nat&ral ly" were enough, the high ability children, at leas't, would have 
performed much better in the silent reading and listening conditions. The 
fact that instructions to elaborate on ^he presented sentences had a large 
effect on the performance of children of every ability level shows that the 
children were not sponta-neously engaging in meaningful processing. 

Other recent research points to a similar conclusion. Notably Hansen 
and Pearson (I982) found that second graders and fourth graders recalled 
dramatically more from basal reader-type stories when preparation for 
reading included questions and discussion designed to activate schemas 
appropriate for understanding the story content. The advantage was 
especially large for, poor readers. From this, Hansen and Pearson inferred 

» 

that good readers have already developed partialfy satisfactory compre- 
hension s.trategies and spontaneously use them some of the time, whereas 
most poor readers either don't have the strategies or don't use them 
unless prompted. It would appear from the Hansen and Pearson research 
that poor readers are in special need of instruction that will facilitate 
comprehension. Yet these are just the children whose attention is 
invariably directed to surface features of language. 

. . 77 . 
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While we do not know for^sure, It Is a reasonable conjecture that 
constant attention to the surface of le-guage will interfere with the 
development of a persistent tendency to read deeply. Getting meaning from 
any text beyond the simplest basal story requires understanding an Imposing 
array of linguistic devices and the development and consistent use of a 
variety of strategies for monitoring comprehension and organizing and 
Integrating Information (see Collins & Smith, 1982). It stands, to reason ' 
that poor readers are going to have a hard time developing these compe- ' 
tencles If their teacher always focuses their attention on the sounds words 
make. 

Educators are likely to be concerned, not only with whether effects 
are significant, but with whether they are large enough to be of practical 
value. The average advantage of a meaning emphasis In the first experiment 
was not large, but this had more to do with how the experiment was cali- 
brated than with the intrinsic power of the treatment. The second 
experiment showed that when the task Is within the range of the children, 
a meaning emphasis has a huge advantage over a word identification emphasis. 
In fact the poorest readers who received a meaning emphasis did better than 
the most able readers who received any other . treatment. Thus, this research 
confirms that the Law of Meaningful Processing holds for children under 
conditions that simulate classroom reading Instruction. 

The practical edutator Is also likely to worry that the teaching 
emphases employed In these experiments were more extreme than teachers 
would actually use, and that a good teacher would work on both word 
Identification, when needed, and meaning. Our reason for studying what 
might be regarded as extremes i's that, when previous data in an area are 
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'inconclusive and opinions are in conflict, it is often useful to design an 
experimeni- as one would write an opera: exaggerate characteristics beyond 
those that may be seen in nature; examine the 1 ill ting case'; and tal<e care 
to observe the full range of every variable. In a murl<y area, premature 

' -study of blends and shades may add to confusion about fundamental issues. 
The value of knowing for sure what will happen und6r limiting conditions 
ought not to be underestimated. 

The practical^ educator may also be concerned because the conclusions 
reached in this research are based on recall of material In one day's 
lesson. What would happen in the long run? ' Could it be that there are ' 

, latent benefits to a word i dent i fic,it ion emphasis that require a big 

investment in drill and practice In order to become manifest? We don't 

know the answer, but until someone finds out, for sure a good rule of thumb 
\ . . • 

is this: in general there is no reason to believe that an ineffective 
{ 

tt-eatment that loses in the'short run will become a winner In the long 
run if It is repeated day after day, week after week. ' 

Finally, we must address a question that will be on the minds of 
concerned members of the lay publit as well as professional educators. 
What^does this research have to say about the teaching of reading at the 
earliest stages? The answer is "very little." In particular, Mt has 
little bearing on whether systematic, .direct instruction in phonics should 
be a component of a beginning readi ng program. Of course, there is every ' 
reason to suppose that the Law of Meaningful Processing holds fpr six-' and 
, seven-year-olds as well as eight- and nine-year-olds and adults. Further- 
more, it seems highly probable that predominant attention to letter-sound 
correspondences would compete with meaningful processing, if only because 
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instructional time Is limited. This insight is not original nor is it 
especially controversial; "it is a point that would be acknowledged by any, 
reasonable advocate of systematic phonics instruction. The controversy (s 

* * 

about whether, once letter-sound principles are under control, chTldrep 

' , ' j 

will turn their main attention to meaning, as is hoped by the advocate of 

direct instruction, or whether, instead, once subjected to direct phonics 
instruction, many children become, trapped in a futile echo chamber of dis- 
embodied sounds, as^is feared by the advocate of the position that'letteV- 
sound' pa t'terns should be acquired as the incidental byproduct of efforts 
after meaning. The research reported here does not illuminate this issue. ■ 

We turn now'to a consideration of the other findings of the research. 
These will be treated more briefly. We will comment on just the highlights 
The Interesting finding about group ability was that it was negatively 
related to. sentence recall in the word identification groups. A two-part 
theory was proposed to explain thii. finding: - (a) the lower the ability of 
the group, the more instructional time the teacher allocated and (b) 
increased instructional time led to greater learning. With the data in 
hand, it was possible to prove tKa^this simple theory provides -a quite 
satisfactory explanation. However, another theory probably gives a better, 
more refined explanation. The augmented theory places th,e rtoding fluency 
of the child taking the active turn, instead of group ability, on the 
causal. path leading to enhanced sentence recall. Indirect evidence showed 
the plausibility of the augmented model ; however, available data did not 
permit a direct, quantitative test of its adequacy. 

'Probably neither of these models would give. a good account of the 
dynamics of learning in the typical classroom. Evidence from classroom 
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research suggests that teachers usually fix the amount of time that wiJl 
be allowed each group rather than fix the amount of material that will be ' 
covered, as the experimental teacher in these studies did. Putting this 
another way, our experimental teacher allowed more time to compensate for 
the slower pace in low groups. Apparently most teachers do not do this. 
Therefore, neither model will exp.lain children's performance in the typical 
classroom. More important, though, than whether any particular model 
provides a good account of classroom dynamics is the demonstration here 
and in other recent research (Leinhardt, Zigmond, & Cooley, I98I) that it 
is possible to formulate ^preci se models of aspects of reading instruction 
and, subject these njodels to rigorous, quantitative evaluation. 

..c^Jt^^^ children's mastery of the day's lesson did not depend on amount 
of instructional time when meaning was stressed is a most newsworthy 
finjiing. !t is consistent with the findings of basic research. At one 
time, when exp^rimei tal psychologists mainly studied lists of words and' 
nonsense syilab-les,' it was thought; that learning inevitably depended upon 
time (Cooper 6 Pantle, I967). Subsequent research suggests that this 
belief 'is false," most especially when the learner is induced to engage 
in deep semantic* processing (Craik & Watkins, 1973). 

, - To x:oncIude that learning from written material is not a time- 
Jepend-ent process whpn the learner is engage^ tn an effort aftjpr meaning 
is not ^tc} imply that 'no^time js required. Every reading process takes 
some time and th^ tfme^it takes depends,^ at least, upon the skill of the 
reader and the length of the material.* The fact that instructional time 
was unrelated to Sentence recall when there was a meaning emphasis suggests 

r 

that *eYffn;the po'ofest readers had enough. time for processing tn the fastest 
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moving of the medning-oi^iented groups* Therefore, the poor performance of 
the poor readers must have been u'ue to ineffective processing; it could not 
have been due to time limitations because, again, if it were, thfere would 
have been a positive relationship between instructional time and performance 
under the meaning emphasis condition. On the other hand, if poor readers 
generally had enough time, then presumably good readers usually had more 
time than the minimum they needed.^ In traditional theories, learning is 
characterized as a process of accretion. The strength or probability of 
'earning is assumed to increase continuously as a function of the time 
invested in rehearsal or elaboration. But no theory of this"sot-t will 
explain our results because the "strength** of the sentence representations 
of the good readers in slow moving, meaning emphasis groups would be 
predicted tD increase, and if this were happening instructional time would 
have been positively correlated with sentence recall. We propose the 
alternative theory that once the time requirements of preliminary processes, 
such as perceptual analysis, are satisfied the learning of the basic 
proposi tional units expressed in sentences is an all-ai:-once event triggered 
by the integration of meaning (see Auble, Franks , & Sorac!, 1979; - ^ 
Goetz, Anderson, S Schallert, 1981). After this event ha^ happened, 
further time is redundant. 

Probably niost pract tci ng educators will not be surprised by the fact 
that active turntaking has a fairly large effect. Perhaps what is^ 
surprising is that, to the best of our knowledge, no previous researcher 
has succeeded in demonstrating the benefits of an active role. tn keeping 
with the philosophy of the experiment-as-opera, a str ictly^-ordered turn- 
taking procedure was employed in this research so that it would be crystal 
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clear to the children when it was and was not the if turn. It stands to 
reason that more open turntaking procedures would dause more children to 
realize the benefits of active involvement more of the time. In support 
of this hypothesis, Au and Mason (I98I) have' recently reported that native 
Hawaiian-Polynesian children show higher* rates of engagement during a 
les,s[on when the children can enter the discussion whenever they wish, and 
overlapping and joint turns are permitted, than when the teacher enforces 
one-at-a-time turntaking. 

Earlier we pointed out that the failure to find an interaction between 
the teaching emphasis and interest must mean that these two.factors have 
Jiheir influence on different underlying facets of the reading process, for 
if the influence were on the same facet then the effects of the two factors 
together would be rellundant. Exactly the same logic applies to the failure 
to find interactions between active participation and the teacfiing emphasis 
and active participation and interest. Thus this research makes the prima 
facie case that these important factors affect different stages, or aspects 
of the reading process. 

The Interestlngness of the materials was a very powerful variable in 
these experiments. It was more important by an order of magnitude than 
readability, the criterion everywhere gsed to gauge the appropriateness 
of school materials. Indeed, the relationship between readability and 
performance was not even in the right direction in either experiment. In 
the worst case, involving the lenient gist scoring of sentence recall in 
Experiment 1, each grade level increase in "difficulty" on the Fry 
readability scale was associated with a 1.83% increase in sentence recall. 
Sentences rated at che seventh grade level, which supposedly are too 
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difficult for third graders, were recalled 12.8% more often than sentences 
rated at the first grade level. It Is important to understand why this 
happened and what it means. Artifactual explanations probably can be ruled 
out. Sentences with a wide range. of readability were used. Several other 
important aspects of the materials were manipulated in a design that made 
them approximately orthogonal to readability. The method of analysis 
insured that readability got just the weight it deserved other things 
being equal . 

Readability foraula'j have been sharply criticized fn recent years 
(e.g., Davison, I98:.. A chief complaint is that none of the formulas 
takes account of tiie interrelationships among sentences and, therefore, 
text that lacks cohesion or clear organization gets passed off as "readable" 
by chMdnin in a certain grade. Proponents of the readability formula tend 
to answer their critics obliquely, dismissing objections on the grounds 
that the facts indicate that the formulas "work." 

We wish to briefly sketch the claim that the facts that allegedly 
support the use of readability formulas are suspect because they are based 
on research that contains errors of design, analysis, and interpretation. 
A problem of analysis was mentioned earlier. Readability research has 
always followed the dubious practice of aggregating data across children, 
computing the performance of the mean second grader, the mean third grader, 
and so on for each of a wide range of text selections. Then the mean 
performance of the children at the different grades is predicted from 
features of the words and sentences in the texts using regression analysis. 
The basis for the claim that readability formulas "work" is that analyses 
of thisftype often explain 80^ to 90^, of the variance. However few people 
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are curious to explain the performance of the hypothetical mean third 
grader. What most educators and behavioral scientists really want is to 
understand diverse individual third graders. If this is the goal, then the 
proper analysis involves the whole array of answers of every child on every 
text, as we explained earlier. If the right analysis were done, the amount 
of variance explained would fa.l 1 precipitously and the reading field would 
hav^e a proper amount of humility about how well readability formulas 
actua 1 ly work. ^ 

A more damaging criticism of readability formulas' is that they provide 
only superficial indicators of the deeper reasons for reading ease or 
difficulty. The major factor is whether the reader possesses background 
knowledge adequate for assimilating a text. We hypothesize that vocabulary 
difficulty, the principal component of every formula, is primarily a proxy 
measure for background knowledge. Anderson and r.-ebody (1981; see also 
Freebody S Anderson, in press) used an example from the jargon of sailing 
to convey this point. A child who knows the meaning of the word "spinnaker" 
is quiteMikely to be able to understand sentences that do not even contain 
"spinnaker," such as, "The sloop jibed suddenly and the boom snapped across 
the cockpit." Obviously the underlying factor that facilitates or inhibits 
comprehension of this sentence is extent of knowledge of sailing. The 
general point is that tr.e jargon that goes with a topic is just the tip of 
the conceptual iceberg. 

In th.e studies in which readability formulas were validated, high- 
flown language always has been associated with subtle, abstract knowledge, 
•simple language with everyday themes. Over the years, the association 
probably has grown stronger because heavy controls on the readability of 
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children's texts have made the language in them unnaturally slinple. This 
means that the confounding of knowledge demands and language complexity 
, has been exacerbated in the recent studies (e.g., Bormuth, 1966; Coleman, 
1970) In which people place most faith. The result is that the formulas 
now In use egregiously overestimate the importance of surface features of 
language. Probably most third graders could get the gist of a story about 
a girl and her puppy even If it were dressed up in fancy language, whereas 
no amount of simplification of the language of an economics treatise would 
permit very many third graders to grasp the concept of the multiplier 
effect. If the foregoing arguments a.e correct, it I-; not surprising that 
readability had weak effects, trending negative. In the two studies 
reported in this paper, since the theme of every sentence was easily 
grasped-by third graders. 

Finally, In the research in which readability formulas were validated 
children read without the help of a teacher. Ironically, readability 
controls are heaviest, not In library books that children do read by them- 
selves, but In basal readers. These are almost always read under the 
direct supervision of a teacher who can help with difficult words. Under 
the word identification treatment in the present research, a mistake on a 
difficult word, far from being a problem, was an opportunity for learning 
(see, Figure 1). 

In addition to whatever strengths it may have, every program of 
research, Indeed, every approach to research, has its limitations. We 
grow Increasingly weary of the quarrelsomeness of proponents of educational 
research style:, who spend 'more time attacking other styles than they do 
providing answers to Interesting questions using the style that they are 
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touting. In mature areas of Inquiry, everyone accepts that there must be 
converging evidence from a variety of sources before important propositions 
can be regarded as proved, in these areas, it is wel 1 -understood that 
chains of inference are long and that it is seldom that any one discipline 
has the tools to forge every link. In this spirit, we readily acknowledge 
the limitations of the 4=esear-e4i reported here. The principal limitation is 
that the research lacks ecological validity: an unfamiliar teacher taught 
especially constituted groups of chMdren; sentences were used instead of 
texts; the measures involved mastery of the -ratenal in one day's lesson. 
This was basic instructional research which ought not to generalized 
directly to regular classroom instruction. We have research in progress 
attempting to test the major conclusions from the studies reported here 
in natural classroom settings. 

On the other hand research of the kind reported in this paper should 
not be undervalued. In these experiments, it was possible to study treat- 
ments rarely seen in nature, for Instance, lessons that were both meaning 
oriented and intensive and systematic. It was possible to study the 
independent effects of variables that are correlated In nature, for 
instance, the ability of the group and the relative emphasis the- teacher 
gives to meaning and word identification. It wcs possible to get a pre- 
liminary look at the causal dynamics in reading groups, a difficult if not 
impossible task in a naturalistic investigation. Probably the most dis- 
tinctive feature of the present research was that it was designed to 
Investigate the interplay of a large number of the factors that converge 
at given moments to determine whether particular sentences will be mastered ' 
or not. Because of this, it was possible to demonstrate that the child 
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tJking a turn was getting more than the other participants; and, It was 
'possible to make more subtle discoveries, such as that the benefits of 
active turntaking and the interest i ngness of sentences do not extend to 
the immedlatelv fol lowing material . Principally, though. It was possible 
to show that the Law of Meaningful Processing continues to operate over a 
diverse range of conditions and with both good and poor readers. 
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